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Abstract 

Photobacterium profundum is a cosmopolitan marine bacterium capable of growth at low temperature and high hydrostatic 
pressure. IVlultiple strains of P. profundum have been isolated from different depths of the ocean and display remarkable 
differences in their physiological responses to pressure. The genome sequence of the deep-sea piezopsychrophilic strain 
Photobacterium profundum SS9 has provided some clues regarding the genetic features required for growth in the deep sea. 
The sequenced genome of Photobacterium profundum strain 3TCK, a non-piezophilic strain isolated from a shallow-water 
environment, is now available and its analysis expands the identification of unique genomic features that correlate to 
environmental differences and define the Hutchinsonian niche of each strain. These differences range from variations in 
gene content to specific gene sequences under positive selection. Genome plasticity between Photobacterium bathytypes 
was investigated when strain 3TCK-specific genes involved in photorepair were introduced to SS9, demonstrating that 
horizontal gene transfer can provide a mechanism for rapid colonisation of new environments. 
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Introduction 

The vast majority of the earth's marine biosphere is at a 
relatively constant low temperature, high hydrostatic pressure and 
is constrained by the small amounts of refractory organic nutrients 
that arrive in pulses from the overlying photic zone [1]. These 
conditions promote and maintain a diverse microbial community 
as detected by culture-independent approaches [2], [3], [4]. It is 
stiU under debate the extent to which the culture-independent 
diversity is autochthonous [4], but generally pressure-adaptation is 
considered a valid criterion to discriminate against microbes 
recendy introduced to the deep sea from shallower waters [4], [5], 
[6]. With few exceptions [6], [7], [8], [9] the majority of pressure- 
adapted isolates in culture span only a narrow phylogenetic range 
of gamma proteobacteria [10]. This includes the easily culturable 
Photobacterium profundum SS9 that has become a model for studying 
adaptations to high pressure. 

With the completion of the P. profundum SS9 genome sequence 
[1 1], the details of physiological responses to pressure have begun 
to unravel: for example, microarray studies have shown that 
suboptimal hydrostatic pressure induces the up-regulation of 
chaperones and DNA repair enzymes [12], and RNA-seq analyses 
have revealed the differential expression of multiple ATP synthases 
and membrane transporters [13]. 



At least four separate strains of P. profundum (SS9, DSJ4, 3TCK 
and 1230sfl) have been isolated and characterized from multiple 
sites in the Pacific ocean [10], [14], [15], [16]. While strains SS9 
and DSJ4 were isolated from deep-sea environments and are 
adapted to high hydrostatic pressure [14], [15], strains 1230sfl 
and 3TCK were recovered from shallower waters and are 
inhibited by elevated hydrostatic pressure (12 and our unpublished 
results). These different ecotypes, which vary in their adaptation to 
depth in the water column, have been defined bathytypes [5] . The 
very existence of phylogeneticaUy cohesive bathytypes suggests 
that the genetic modifications required to evolve depth-specific 
adaptations can be rapidly evolved [10] and bathytype conversion 
might occur quite frequently as a result of advective transport of 
microbial communities by phenomena such as up/ downweUing, 
Ekman transport and thermohaline circulation [10], [17]. The 
rapid development of mutants tolerant to pressure inactivation, 
which has been found to extend into the gigapascal range even for 
non-marine bacteria [18], might aid specific taxa in rapidly 
adapting to the new environmental conditions while restricting 
others. 

In this study the genome plasticity between two bathytypes of P. 
profundum is analysed in detail. The results show that no single gene 
is likely to restrict the environmental niche (sensu Hutchinson [19]) 
of each strain, but a number of genetic features specific to each 
strain can confer specific abilities to cope with depth-specific 
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environmental stresses (e.g. temperature, pressure, nutrient avaH- 
abUity). Some of these features carry signatures of horizontal gene 
transfer (HGT) suggesting one possible mechanism for the rapid 
evolution of new bathytypes. 

To test the feasibility of bathytype conversion, a cluster of genes 
involved in the repair of ultraviolet light-induced DNA damage 
were transferred from the shallow to the deep bathytype. The lack 
of this UV protective function is predicted to restrict the 
colonization of shallower waters by deep bathytypes. This is the 
first study employing intra-specific sequence comparisons in 
combination with molecular genetics to address the bases of niche 
partitioning in piezophilic bacteria. 

Materials and Methods 

Strains and Growth Conditions 

The bacterial strains used in this study are listed in Table 1 . The 
strains of P. profundum were cultured in 75% strength 2216 Marine 
Medium (28 g/1; Difco Laboratories) at 15°C and 0.1 megapascal 
(MPa), unless otherwise specified. E. coli strains were grown 
aerobically at 37°C in Luria-Bertani (LB) medium. High-pressure 
growth experiments were performed by inoculating in heat- 
sealable plastic bulbs containing media and no gas space. The 
heat-sealed bulbs were placed in pressure vessels and pressurized 
as previously described [20], [21]. 

When needed, antibiotics were used in the following final 
concentrations: rifampicin (Rif), 100 |J,g/ml; kanamycin (Km), 
100 M-g/ml [E. colt) or 200 |J.g/ml (P. profundum); streptomycin (Sm), 
50 (Xg/ml (E. coli) or 150 |J.g/ml {P. profundurri). X-Gal (5-bromo-4- 
chloro-3-indolyl- [beta] -D-galactopyranoside) was added to sohd 
medium at 40 |J,g/ml in N,N-dimethylformamide. The introduc- 
tion of plasmids in P. profundum was achieved by tri-parental 



conjugations using the helper E. coli strain pRK2073 as previously 
described [22]. 

Genome Sequencing, Assembly and Annotation 

Genomic DNA was obtained from a culture of P. profundum 
strain 3TCK in mid-exponential growth. Approximately 1 liter of 
a liquid culture was harvested by centrifugation for 1 5 minutes at 
5,000 xg and the pellet was resuspended in 5 ml buffer A (50 mM 
Tris, 50 mM EDTA, pH 8.0). The suspension was incubated 
overnight at — 20°C and thawed at room temperature with the 
addition of 500 III of buffer B (250 mM Tris, pH 8.0, 10 mg/ml 
lysozime). After 45 min of incubation on ice, 1 ml of buffer C 
(0.5% SDS, 50 mM Tris, 400 mM EDTA, pH 7.5, 1 mg/ml 
Proteinase K) was added and the mixture was placed in a 50°C 
water bath for 60 minutes. An additional 750 |a,l of buffer C were 
added followed by an additional 30 minutes of incubation at 50°C. 
The genomic DNA was extracted twice with 5 ml of phenol:- 
chloroform:isoamyl alcohol (24:24:1), and precipitated with 0.8 
volumes of isopropanol. The DNA pellet was recovered by 
spooling on a glass rod, and rehydrated overnight at 4°C in 4 ml of 
buffer D (50 mM Tris, 1 mM EDTA, 200 ug/ml RNAse A, 
pH 8.0). Further purification was performed by extracting once 
with 4 ml of chloroform, then precipitating with 3.2 ml of 
isopropanol. The DNA pellet was recovered by centrifugation, 
washed once with 5 ml of 70% ethanol and stored dry at — 20°C. 

The dry DNA pellet was shipped to the J. Craig Venter Institute 
where a draft genome sequence was obtained witii a conventional 
whole-genome sequencing approach by preparing two genomic 
libraries with insert sizes of 4 kb and 40 kb as described in 
Goldberg et al. [23] and the resulting sequences were used as input 
for the Celera assembler [24]. The draft genome sequence was 
deposited in NCBI under the BioProject accession number 
PRJNA13563. The reference genome sequence of P. profundum 



Table 1. Strains and Plasmids used in this study. 




Strain/plasmid 


Relevant genotype or description 


Reference 


P. profundum 


SS9 


Wild type, deep bathytype of P. profundum 


[14], [15] 


DSJ4 


Wild type, deep bathytype of P. profundum 


[15] 


3TCK 


Wild type, shallow bathytype of P. profundum 


[12] 


SS9R 


Rif SS9 derivative 


[22] 


3TCKR 


Rif 3TCK derivative 


This study 


E. coli strains 


ED8654 


pRK2073 maintenance 


[60] 


DHSa 


recA^ , used for cloning 


[61] 


XLI-Blue 


recA^ , used for cloning 


Stratagene 


TOPIO 


recA^ , used for cloning 


Invitrogen 


Plasmids 


pRK2073 


Providing tra genes for conjugal transfer 


[62] 


pFL122 


RSF1010 derived, broad host range cloning vector, Sm^ 


[37] 


pFL190 


RSF1010 derived, arabinose-inducible expression vector, Sm^ 


[37] 


pFL303 


phr gene cluster in pFL122, Sm"^ 


This study 


pFL304 


A22 deletion of phr gene cluster in pFL122, Sm^ 


This study 


pFL305 


pFL304+phr promoter, Sm"^ 


This study 


pFL306 


pFL304+rpoX+p/7/" promoter, Sm' 


This study 


pFL307 


phr in pFL190, Sm' 


This study 


doi:l 0.1 371 /journal.pone.0096953.t001 
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Figure 1 . Pulsed field gel electrophoresis of chromosomal plugs of P. profundum strains SS9, DSJ4 and 3TCK. The plugs were digested 
overnight with /-Ceul and run under the following conditions: (A) Voltage: 6 V/cm; Pulse-time: 3-18.5 s; Runtime: 18.5 h; Temperature: 14 C; Included 
angle: 120°; Gel: 0.95% PFGE agarose in 0.5x TBE (B) Voltage: 6 V/cm; Pulse-time: 6.7-54 s; Runtime: 28 h; Temperature: 14 C; Included angle: 120'; 
Gel: 0.95% PFGE agarose in 0.5x TBE. 
doi:1 0.1 371 /journal.pone.0096953.g001 



SS9 was also retrieved from NCBI under BioProject accession 
number PRJNA13128. The number of ribosomal RNA operons 
was estimated by Pulsed Field Gel Electrophoresis of genomic 
DNA digested with I-CcmI as previously described [25]. The draft 
assembly was submitted to the NCBI PGAAP (http:/ /www.ncbi. 
nlm.nih.gov/genomes/static/Pipeline.html) for prediction of 
Open Reading Frames (ORFs) and automatic annotation. For 
genomic sequence comparisons the scaffolds of 3TCK were 
oriented and joined in alignments to the reference genome of SS9 
with the 6-frame stop-codon spacer 'NNNNCACACACTTAAT- 
TAATTAAGTGTGTGNNNN' using the custom peri script 
scaffolding.pl to create a contiguous pseudomolecule. Scaffol- 
ding.pl and other custom perl scripts used in this study are 
available at https://github.com/flauro/3tck_comparative. 

Bioinformatic Analyses 

The assignment of ORFs to Clusters of Orthologous Groups 
(COGs) and statistical comparisons were performed as previously 
described [26] using the method of Rodriguez-Brito [27] with a 
subsample size of 4000 and 10,000 bootstraps. The average 
nucleotide identity (ANI) between the genomes was computed as a 
reciprocal two-way average with the method of Goris et al. [28] 
using custom perl scripts and the following parameters: fragment 
size 1020 bp; minimum identity 30%; minimum alignable region 
714 bp. 

The ratio of non-synonymous substitutions per non-synony- 
mous site to synonymous substitutions per synonymous site (co) was 
computed for every pair of orthologous genes using a custom perl 
pipeline. Briefly, orthologous gene-pairs were found using the 
reciprocal smallest distance algorithm [29], aligned with MUS- 
CLE [30], and ff) was calculated using the KaKs calculator [31] 



with the YNOO method [32]. The statistical significance of 
orthologous pairs with <n> 1 was assessed with a Fisher exact test. 
The time of divergence between the strains (t) was estimated from 
the formula T = Ks/(2*>^) where X = 8.3x10"' SNPs/site/year 
[33] medianed across all pairs of orthologs. Codon usage was 
calculated as described by Karlin [34] using a custom perl script. 
The relevant biases in codon usage were identified using the 
methods described in [35], [36]. Briefly, the codon usage of each 
gene is compared to the genome-wide mode of codon usage, and 
the significance is established using a Chi-square test. The p-value 
threshold was set to 0.1. 

Cloning Experiments 

All restriction enzymes were purchased from New England 
Biolabs (Beverly, MA, USA). AU the PGR amplifications were 
performed using the Expand Long Template PGR system (Roche 
Applied Science, Indianapolis, IN, USA). 

The genes conferring UV resistance were cloned in pFL122 as 
follows. A fosmid clone (GCLNU_G4) from the P. profundum 
3TCK sequencing library containing P3TCK_10673 i^-poX; RNA 
polymerase sigma factor, ECF subfamily), P3TCK_10668 (Con- 
served Hypothetical Protein), P3TCK_10663 (phr, deoxyribodi- 
pyrimidine photolyase) was cut with Xho\-\-KpA. The 7.2 kbp band 
contained the genes of interest and was gel purified and ligated in 
pFL122 [37] cut with Xhol+Kpnl yielding pFL303. The deletion 
A22 was obtained by cutting pFL303 with iscoRI and re-ligating, 
which effectively eliminates P3TCK_10673, most of 
P3TCK_10668 and the region with the two divergent promoters 
between the two. This deletion construct was named pFL304. The 
promoter region was PGR amplified from pFL303 using primers 
PR0MPH02F (5' - GTCGAATTCCTTTTCTTGCAGCGT- 
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Figure 2. Chromosome organization in P. profundum strains. ACT nucleotide comparison [4] between the cPiromosomes of the two 
bathytypes SS9 and 3TCK. From top to bottom: (A) P. profundum 3TCK ch romosome 1 (B) P. profundum SS9 chromosome 1 (C) P. profundum SS9 
chromosome 2 (D) P. profundum 3TCK chromosome 2. 
doi:1 0.1 371 /journal.pone.0096953.g002 




Figure 3. Gene content comparison in P. profundum strains. 

Shared gene comparison between the two bathytypes SS9 and 3TCK. 
The larger Venn diagram shows the number of orthologous genes 
shared and the number of genes unique to each bathytype. The smaller 
pie charts are the proportion of unique genes to each bathytype with 
(light blue) or without (dark blue) matches to the COG database. The 
matching genes were assigned to COG categories (from top to bottom): 
E-Amino acid transport and metabolism (red); G-Carbohydrate transport 
and metabolism (green); N-Motility and chemotaxis (violet); S-Function 
unknown (cyan); R-General function prediction only (orange); L-DNA 
Replication, Recombination and Repair (blue); T-Signal Transduction 
(pink); K-Transcription (light green); Other categories (grey). 
doi:1 0.1 371 /journal.pone.0096953.g003 



CAGT - 3') and PR0MPH02R (5' - GTCGAATTCTAG- 
TAAGCGAATAGCAGGAC -3'). 

Similarly the promoter region and the whole length 
P3TCK_10673 was amplified with primers SIGMAPH02F (5' 
- GTCGAATTCGTATTCAAGATGGGCACTCA - 3') and 
the same reverse primer as above. Both amplicons were digested 
with EcoBJ and cloned in the EcoBJ site of pFL304 yielding 
pFL305 (promoter only) and pFL306 (promoter and 
P3TCK_10673) respectively. The directionality of the inserts 
was checked by PGR and confirmed by standard thermal cycle 
dideoxy sequencing with fluorescentiy labelled terminators (Ap- 
phed Biosystems, Foster City, CA, USA). 

For the arabinose-inducible UV resistance experiments, the phr 
gene, inclusive of its ribosome binding site, was amplified with 
primers expPH02F (5' - ATGGCCGTCTGCAA- 
GATCCTGTA -3') and expPH02R (5' - GCTCTAGAGC- 
CACCCATTCATACGATGTGC - 3'), digested with EcoRI+ 
Xhal and cloned in the expression vector pFL190 [37] cut with the 
same enzymes. 

In vivo Photoreactivation 

The effect of ultraviolet light on the survival of P. profundum 
strains was tested as follows. Serial dilutions of late exponential 
cultures were plated on 75% strength 2216 Marine Agar. For each 
strain a triplicate dilution series was prepared: one untreated, one 
UV irradiated without blue light recovery, and one UV irradiated 
followed by a recovery period under blue light. 
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Figure 4. COG analysis comparison of the gene content in the two bathytypes. Reported are the median differences after resampling as 
described in material and methods. In pink the categories that were statistically over- or under-represented in one of the bathytypes at 98% 
significance. 

doi:1 0.1 371 /journal.pone.0096953.g004 



The cells were irradiated uncovered using a germicidal lamp 
with an emission peak at 253.7 nm (Philips G25 T8), for 
10 seconds at a power of 220 |a.W/cm2. 

For the photoreactivation, the Petri dishes were then covered, to 
filter out the shorter wavelength radiation, and allowed to recover 
for 1 hour under "black" light (Philips TLD 15 W/08), emitting in 
the 350-400 nm range, at an irradiance of 20 |J,W/cm2. 
Irradiances were measured with a Spectroline DM-365 XA digital 
radiometer (Spectronics corp., Westbury, NY, USA). 

The plates were wrapped in foil and grown at 15°C for 5 days 
after which c.f u. were counted and the number of colonies in the 



irradiated samples were compared with the untreated controls to 
calculate the percent survival. 

In all experiments, cell transfers and manipulations were 
performed under General Electric "gold" fluorescent light to 
prevent uncontrolled photorepair. 

Results and Discussion 

Bacteria and Archaea can be transported vertically through the 
water column as a result of attachment to sinking particles (see, for 
example [38]) and migrating metazoans or other phenomena such 
as advective transport [17]. Growth and survival at different 




•er©- 



Figure 5. Correspondance analysis of codon usage for the ORFs of P. profundum 3TCK. The ORFs belonging to the phr gene cluster (in the 
insert) are color coded. The green triangle represents P3TCK_10668, the yellow square P3TCK_10673, and the blue diamond P3TCK_1 0663. The codon 
usage of P3TCK_10663 and P3TCK_10673 suggests their recent acquisition by horizontal gene transfer. 
doi:1 0.1 371 /journal.pone.0096953.g005 
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Figure 6. Introduction of the phr gens cluster from the shallow bathytype 3TCK into the deep bathytype SS9 confers UV resistance. 

TInis phenotype is not observed in the deletion construct that lacl<s the upstream hypothetical protein and rpoX gene (pFL304). The UV resistance 
phenotype can be partially restored by re-adding to pFL304 the promoter region of the cluster (pFL305) but is completely restored only when both 
the promoter and the rpoX sigma factor are added (pFL306). The absence of P3TCK_10668 (Hypothetical Protein) does not affect UV sensitivity. The 
UV survival plots present the ratio of c.f.u. observed after UV exposure followed by blue-light photoreactivation (red) compared to the non- 
photoreactivated controls (blue) as described in the materials and methods. 
doi:1 0.1 371 /journal.pone.0096953.g006 



depths requires adaptation to many depth-correlated chemo- 
physical parameters (e.g. light, hydrostatic pressure, organic 
carbon). For example, it has been shown that adapting to a 
higher hydrostatic pressure requires adjustments to membrane 
structure, DNA synthesis, translation, and protein quaternary 
structure [39]. Pressure also affects gene regulation at the level of 
transcription [12], [13] and translation [40]. The concentration of 
nutrients varies greatly as a function of depth and it is possible that 
marine bacteria use pressure as a proxy for depth in gene 
regulation to respond to differences in nutrient availability. The 
switch between different outer membrane porins as a function of 



pressure, which has been observed in P. profundum SS9, is likely a 
result of such a response. Conversely, adaptation to shallow waters 
would require the acquisition of novel genes to cope with unique 
stressors, such as UV light. Many of these features are evident 
from the genome comparisons of different bathytypes of P. 
profundum presented here. 
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Figure 7. The UV resistant pKienotype depends uniquely on the levels of expression of the phr gene (P3TCK_10673). Cloning of the 
phr gene under the arabinose-inducible promoter of pFL190 confers UV-resistance to the cells only when grown with 0.1% L-arabinose. The UV 
survival plots present the ratio of c.fu. observed after UV exposure followed by blue-light photoreactivation (red) compared to the non- 
photoreactivated controls (blue) as described in the materials and methods. 
doi:1 0.1 371 /journal.pone.0096953.g007 



General Features of the Draft Genome of Photobacterium 
profundum Strain 3TCK and Comparisons with the 
Genome of the Previously Sequenced Strain SS9 

The draft genome of the shallow bathytype Photobacterium 
profundum 3TCK contains 1 1 scaffolds for a total length of 
6,186,725 bp with average 41.3% GC encoding for a total of 5549 
ORFs. Gene synteny plots and the existence of two different 
origins of replication [41] suggest that, similar to other members of 
the family Vibrionaceace [42], the genome is organised in two 
chromosomes. This size and structure is comparable to that of the 
previously sequenced deep bathytytpe SS9 [11], but appears to 
lack an 80 kb dispensable plasmid specific to SS9 [12]. 

The genome encodes for a complete set of tRNA synthetases 
and shares with SS9 the peculiarity of having the genes for the 
synthesis of selenocysteine and its incorporation into proteins. Like 
SS9 the genome also encodes for two complete FqFi-ATP- 
synthases and a type A FAS/PKS system [43] for the synthesis of 
polyunsaturated fatty acids such as eicosapentaenoic acid (EPA; 
20: 5n-3). 

P. profundum 3TCK has larger-than-average intergenic regions 
(~167 bp), a feature shared with most sequenced piezophUes [5], 
although the size of the intergenics is smaller than in the deep 
bathytype SS9 (~205 bp). These large intergenic regions have 
been shown to be transcribed and differentially expressed as a 
function of pressure [13], suggesting they could play an important 
physiological role. 

P. profundum 3TCK contains at least 9 copies of the ribosomal 
RNA operon (rrn; Figure 1) a number which is larger than the 
median for microbial genomes [44]. Interestingly, SS9 still holds 
the record for the highest copy rrn number in a single genome 
with 15 copies. These operons display intragenomic variation in P. 
profundum SS9, while they are almost identical in 3TCK. The 
variability is concentrated in specific loops of the 16S and the 23S 
rRNA subunits [10], [45] and is predicted to contribute to the 
ribosome stability or function at high-hydrostatic pressure. 

ACT comparisons [46] between the nucleotide sequences of the 
two strains highlights the presence of a number of insertions/ 



deletions, multiple inversions across the origin/ terminus of both 
chromosomes, but only a limited number of translocations across 
the chromosomes (Figure 2). 

The identity between the 16S gene sequences of 3TCK and SS9 
is 98.73%, which suggests the strains belong to the same species, 
however the ANI between 3TCK and SS9 is 92.85% with a 
percent conserved DNA of 62.68%, which is well below the species 
threshold for genome-level comparisons (ANI>95%; conserved 
DNA>69%; [28]). The largest proportion of genes unique to each 
genome is located on chromosome 2 (Figure 2 and Figure 3). 
Within the family Vibrionaceae this chromosome has been 
previously implicated in gene capture for environmental adapta- 
tions during the colonization of new niches [42] . 

The global analysis of Ka/Ks ratios identified only four gene 
pairs with a)>l, but none of these was statistically significant 
determined by a Fisher exact test (P<0.01). The median time of 
divergence between the strains was 126,833 years ago which is 
remarkably comparable to the time of establishment of the 
modern thermohaline circulation. This network of advective 
currents is the predicted cause behind the spatial separation of 
the 2 bathytypes. 

The extreme synteny, large proportion of insertions and 
deletions in association with a low number of sequences with 
(D> 1 and the relatively short divergence time suggests a larger role 
for HGT rather than sequence substitutions in the evolution of 
bathytypes. Furthermore, it is a clear incUcation that the strains are 
currently undergoing adaptive radiation driven primarily by gene 
acquisition and loss. 

COG comparisons between the two strains revealed a statistical 
over-representation in the shallow bathytype 3TCK of genes for 
energy production (COG category C,) but a significant decrease in 
genes for motility and chemotaxis (N) and DNA replication, 
recombination and repair (L) (Figure 4). 

The abundance of genes for category L (DNA replication, 
recombination and repair) in the deep bathytype SS9 is due to the 
large number of transposable elements. The amplification of 
mobile elements seems to be a distinctive feature of all sequenced 
deep-sea genomes and has been observed in metageiiomic surveys 
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of different depths in the water column [3], [47]. The wide 
diversity of the identified transposases in the deep-sea samples that 
could not be accounted for by biases in community composition 

led DeLong et al. [47] to hypothesize that the over-representation 
of transposable elements relates to the slower growth and smaller 
effective population size of deep-sea microbial communities. 
Compatible with this hypothesis, the 206 COG-categorized 
transposable elements found in SS9 belong to 11 families, the 
most numerous of which (COG,S436) has as many as 72 members. 
On the other hand, 3TCK encodes for just 3 COG-categorized 
transposable elements. Taken together these data support the 
hypothesis of intra-genomic amplification of transposable elements 
in the deep-sea due to habitat differences. Nevertheless, a 
transposon mutant with a cold-sensitive phenotype has been 
isolated in SS9 [48] suggesting that these mobile elements could 
have a functional role. 

In P. profundum SS9 the genes for motility and flagellar assembly 
are arranged in two large clusters, one that is shared with 3TCK 
and a second one that is unique to the piezophile SS9 as a result of 
a large contiguous deletion in the genome of 3TCK. This deletion 
accounts for the under-representation of genes from COG 
category N (Motility and Chemotaxis) in the genome of 3TCK. 
This second cluster is most similar to a lateral flagella gene cluster 
present in some Vibrio strains [49] and its role in motility and 
chemotaxis has been previously analysed [,50] comparing the 
piezosensitive motility of 3TCK to the piezoresistant one of SS9. 

Another relevant genomic island encompasses genes 
PBPRA2666-PBPRA2712. This gene cluster is involved in cell- 
envelope biosynthesis and was initially suggested to be missing in 
3TCK during a previous microarray study [12]. However, the 
genome sequence of 3TCK does, in fart, contain a similar and 
syntenic cluster (P3TCK_26512-P3TCK_26972), but with a 
highly divergent sequence. Interestingly, the cluster in SS9 
contains at least 3 genes (PBPRA2678, PBPRA2681, PBPRA2684) 
that cause a cold-sensitive phenotype when inactivated [48], 8 
genes that are regulated by temperature or pressure at the level of 
transcription (PBPRA2689, PBPRA2691, PBPRA2692, 
PBPRA2697, PBPRA2701, PBPRA2702, PBPRA2707, 
PBPRA2710) [12], [13] and 2 at the level of translation 
(PBPRA2686, PBPRA2687) [40]. 

Conversely, the over-representation in 3TCK of genes for 
Energy Production and Conversion (C) is caused by the expansion 
of COGs involved in anaerobic respiration of nitrate such as the 
periplasmic nitrate reductase napABCEG (COG0243, COG3043, 
COG3005, COG4459, COG1145), formate dehydrogenase 
(COG1526), cytochrome c5,53 (COG2863) and a two-component 
response regulator (COG2197) specific for nitrate reduction 
(P3TCK_05837). In addition chromosome 2 of 3TCK contained 
a 12 kb genomic island encompassing ORFs P3TCK_02186- 
P3TCK_02126) that encodes for the alpha (COG0804), beta 
(COG0832) and gamma (COG0831) subunits of urease and its 
accessory proteins. These genes are arranged in a single operon 
that also encoded for an ABC transporter. Their sequences did not 
display an altered GC%, GC skew or codon usage suggesting that 
they had been lost by SS9 rather than acquired by 3TCK. 
Nevertheless, their presence might be reflective of differences in 
the chemistry of the more eutrophic habitat of the San Diego Bay 
sediments (S\-liere P. profundum 3TCK was isolated) \'ersus that 
associated with deep-sea amphipods (from which P. profundum SS9 
was isolated) and is a further evidence of how the genome plasticity 
of P. profundum is the key to its adaptive radiation. 



The Conversion of the Deep Bathytype to UV Resistance 

The absence of light (apart from chemUuminescence or possibly 
bioluminescence) in the deepest depths of the oceans argues for a 
selective loss of genes associated with light tolerance. In fact the 
piezophile Psychromonas species strain CNPT3 [5 1] has been shown 
to be extremely sensitive to UV radiation [52]. 

The two most common types of UV-induced lesions on DNA 
are the generation of cyclobutane pyrimidine dimers (CPDs) and 
pyrimidine-pyrimidone 6-4 photoproducts (6-4PPs) [53]. This 
damage to DNA can be repaired by multiple pathways [53], but 
photoreactivation by deoxyribodipyrimidine photolyas(;, the 
product of the phr gene, is unique in that it requires blue light 
energy to split the CPDs or the 6^PPs [54]. 

Because of this blue light requirement the genes for the 
deoxyribodipyrimidine photolyase are expected to be absent from 
the genomes of deep-sea microbes. In fact, the SS9 genome does 
not contain a phr gene and the metagenomic analysis of the 
distribution of genes in a stratified water column [47] showed 
significant over-representation of phr genes from the photic region 
when compared to samples from deeper waters. 

In contrast to SS9, other members of the family Vibrionaceae 
have been shown to primarily rely on the activity of photolyases 
for the repair on UV-induced damage [55], [56]. The genome of 
Vibrio cholerae N16961 [57] encodes for three different members of 
the cryptochrome/photolyase family [56]. The first one 
(VCA0057) functions in repairing CPDs in dsDNA [56] the 
second one (VC1814) in repairing CPDs in ssDNA [55] while the 
function of the third one (VC1392) is still unknown. A similar 
array of photolyase-like ORFs can be seen in the genomes most 
other members of the Vibrionaceae, including the draft genome of 
Photobacterium sp. SKA34 (https://moore.jcvi.org/moore/ 
SingleOrganism.do?speciesTag = SKA34) encoding for orthologs 
to all three cryptochromes/photolyases of V. cliolerae. 

The shallow bathytype 3TCK contains a. phr gene within a three 
gene cluster (P3TCK_10663, P3TCK_10668, P3TCK10673) on 
chromosome 2 with altered codon usage (Figure 5). The phr gene 
(P3TCK_10663) and the upstream hypothetical protein 
(P3TCK_10668) are part of a predicted operon with a promoter 
upstream of P3TCK_10668 driving their expression. A different 
promoter in the opposite direction drives the transcription of rpoX 
(P3TCK_10673), an ECF-type sigma factor. Because of this 
arrangement, it was hypothesised that the phr gene cluster had 
been acquired by HGT under the selective forces provided by UV 
light exposure in shallow water. 

The cluster was cloned on a broad host-range plasmid and 
introduced into SS9 (Figure 6; pFL303) resulting in approximately 
1,000 fold increase in colony forming units (c.fu.) survival after 
UV irradiation compared to the controls. This survival was 
dependant on blue-light incubation (Figure 6). A deletion 
encompassing rpoX and most of the hypothetical protein 
(P3TCK_10668) abolished photoreactivation (pFL304). If die 
promoter region is re-introduced in the right orientation into the 
deletion construct (pFL305), photoreactivation was partially 
restored, yielding approximately 100-fold more surviving c.fu. 
than the untreated controls. The fuU restoration of the UV 
resistant phenot)pe could be obtained only by cloning, in the right 
orientation, both the promoter and the sigma factor (pFL3()6). 

Based upon these ri^sults it was suggested that the gene encoding 
the hypothetical protein is dispensable for photoreactivation 
activity and the UV resistance phenotype is solely dependent on 
the level of expression of the phr gene. To test both hypotheses the 
phr gene alone was cloned in a vector (pFL190) with an arabinose- 
inducible promoter (pFL307). The full UV resistant phenotype 
was observed only after induction with 0. 1 % arabinose (Figure 7) 
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indicating that high levels of expression of the phr gene alone are 
necessary and sufficient to confer UV resistance. 

A noteworthy result of this experiment is that the functional 
activity of the phr gene cluster benefits from the presence of a 
flanking sigma factor. There is a precedent for this type of 
observation. Sometimes acquired genes must be obtained as 
clusters of functional units to overcome the barrier caused by the 
incapacity to transcribe the HGT gene at the appropriate level 
[58]. Genes providing marginal benefits, like photolyase, can also 
be readily lost from a population when the increased metabolic 
cost for replication is not balanced by selective pressure as was 
observed between high- and low-light-adapted Prochlorococcus 
strains [59]. These processes of gene cluster gain and loss generate 
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